CLASSICAL TEST THEORY vs. ITEM RESPONSE THEORY An evaluation of the theory test in the Swedish driving-license test

نویسنده

  • Marie Wiberg
چکیده

The Swedish driving-license test consists of a theory test and a practical road test. The aim of this paper is to evaluate which Item Response Theory (IRT) model among the one (1PL), two (2PL) and three (3PL) parameter logistic IRT models that is the most suitable to use when evaluating the theory test in the Swedish driving-license test. Further, to compare the chosen IRT model with the indices in Classical Test Theory (CTT). The theory test has 65 multiple-choice items and is criterionreferenced. The evaluation of the models were made by verifying the assumptions that IRT models rely on, examining the expected model features and evaluating how well the models predict actual test results. The overall conclusion from this evaluation is that 3PL model is preferable to use when evaluating the theory test. By comparing the indices from CTT and IRT it was concluded that both give valuable information and should be included in an analysis of the theory test in the Swedish driving-license test. INDEX INTRODUCTION 1 AIM 2 METHOD: SAMPLE 3 METHOD: CLASSICAL TEST THEORY 3 METHOD: ITEM RESPONSE THEORY 4 1. Verifying the assumptions of the model 5 A. Unidimensionality 5 B. Equal discrimination 5 C. Possibility of guessing the correct answer 5 2. Expected model features 6 3. Model predictions of actual test results 6 Estimation methods 7 RESULTS: CLASSICAL TEST THEORY 8 Modeling the items 10 RESULTS: ITEM RESPONSE THEORY 10 1. Verifying the assumptions of the model 10 2. Expected model features 12 Invariance of ability estimates 12 Invariance of item parameter estimates 13 3. Model predictions of actual test results 16 Goodness of fit 16 Normal distributed abilities 16 Test information functions and standard error 17 Rank of the test-takers 18 COMPARING CTT AND IRT 20 DISCUSSION 22 FURTHER RESEARCH 24 REFERENCES 25 APPENDIX

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Psychometric Properties of State Level Subjective Vitality Scale based on classical test theory and Item-response theory

The purpose of the present study was to investigate the factor structure and Item-Response parameters of State Level of Subjective Vitality Scale. The research design was correlational, and the statistical population consisted of students of the Shahid Beheshti University of Tehran. Sample group including 240 students were selected through multi-stage sampling and completed Subjective Vitality ...

متن کامل

Psychometric Properties of the Brief Form of Professor-Students Rapport Scale-based on Classical Test Theory and Item-Response Theory

Introduction: In order to improve the quality of the teaching process, it is necessary to review the professor-student rapport. The purpose of the present study was to investigate the factor structure and item-response parameters of Professor-Students Rapport Scale-Brief (PSRS-B). Methods: In a descriptive-correlation study, 497 students from Shahid Beheshti University of Medical Sciences were ...

متن کامل

The Comparison of Two Models for Evaluation of Pre-internship Comprehensive Test: Classical and Latent Trait

Introduction: Despite the widespread use of pre-internship comprehensive test and its importance in medical students’ assessment, there is a paucity of the studies that can provide a systematic psychometric analysis of the items of this test. Thus, the present study sought to assess March 2011 pre-internship test using classical and latent trait models and compare their results. Methods: In th...

متن کامل

Selection the best Method of Equating Using Anchor-Test Design‎ in Item Response Theory ‎‎

Explaining the problem. The equating process is used to compare the scores of the two different tests with the same theme‎. ‎The goal of this research is finding the best method of equating data using Logistic model. ‎ Method. we are using the data of Ph.D‎. ‎test in Statistic major for two consecutive years 92 and 93‎. ‎For analyzing‎, ‎we are specifically using the tests of Statistics major ...

متن کامل

ویژگی‌های روانسنجی مقیاس افسردگی نوجوانان براساس نظریه سوال- پاسخ و مقایسه نتایج با نظریه کلاسیک آزمون

Background and Aim: The objective of this study was to assess the psychometric properties of the Adolescent Depression Scale (ADS) based on the item-response theory and compare the results with those based on the classic test theory. Materials and Methods: A total of 750 students (364 males and 386 females) were selected through multistage random clustering (levels proportional to size) and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004